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ABSTRACT 


A brief resume of the role of linear utility functions in Came 
Theory is given. The point is made that, in practical applications, a 
knowledge of these functions is usually not available, and hence much 
of the rationale of the game theoretic approach to competitive problems 
is lost. The random character of the "real" payoff of a matrix game is 
then discussed, and the probability distribution function of the payoff 
is derived. The dependence of this distribution function upon the mixed 
strategies of the players is shown. Criteria are developed to provide 
definition of "optimal mixed strategy” in terms of the effect on the 
distribution function. The mathematical formulation of the solution is 
given for each criterion discussed, 

The reader will require knowledge of the elements of Probability 


Theory, Game Theory, Utility Theory, and Linear Programming. 
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1 = =Introcuction. 

Tne purpose Of this paper is to investigate a difficulty that inevit- 
ably exists whenever an application of the mathematical Theory of Cames 
is made in a non-idealized situation. We shall restrict our attention to 
the class of two-person matrix games, although the basic question arises 
in connection with n-person and continuous games as well. Briefly stated, 
the problem is as follows: 

In the logic of the Theory of Games, a vital part is played by the 
so-called "linear" utility function, a real-valued function defined over 
the space of outcomes associated with a game. In actual practice, it is 
very doubtful that this function will be available to, let us say, a mil- 
itary operations analyst who wishes to formulate a practical problem as 
a game. On the other hand, when the situation involves a conflict of in- 
terest between intelligent antagonists, with an element of chance affect- 
ing the outcome of any course of action, .t is difficult to find a concept 
of "rational procedure" or "optimal decision" which does not involve the 
game theoretic approach. Therefore the qu:stion becomes: in the absence 
of an important part of its machinery, what adaptations of Game Theory 
can provide us with reasonable solutions to problems of practical impor- 
tance? 

In standard texts on the Theory of Games, it is usual to present 4 
discussion of utility theory at an early stage. The axioms leading to 
construction of the linear utility function (hereafter referred to as 
the l.u.f.) are developed, and once this has been done, it is customary 
to assume that the appropriate l.u.f.'s are known, and proceed with the 


mathematical development of the theory. 





We wish to change this procedure slightly, in order to do two things: 

(1) Clarify the theoretical role of the i.u.f. in the logic of Game 
Theory. 

(2) Emphasize the vital importance to the military operations analyst 
of a knowledge of appropriate l.u.f.'s, when formulating "real" problems 


in game-theoretic terms. 


2. Typical formulation of a competitive situation as a matrix game, 
Distribution of the payoff. 


Suppose that a situation involving conflict of interest between two 
parties ("player I" and "player II") has been analysed so as to yield the 
following quantities; 

(1) The space of all pure strategies available to player I. 

(2) The space of all pure strategies available to player II. 

(3) A matrix, the elements of which describe what happens when play- 
er I follows his ith pure strategy, and player II follows his jth pure 
strategy (i= 1,2,...m; j= 1,2,...n). 

We shall assume that the outcome, or payoff to player I, resulting 
from the choice (i,j) of pure strategies, can be measured in terms of a 
single "real commodity", e.g., money, or casualties, etc.! 

The first important point we make here is that nothing in Game 
Theory requires that a fixed real-commodity payoff to player I be agsoc- 
fated with the pair (i,j). Rather, the requirement is this: the payoff 
to rater I resulting from the choice (i,j) is a random variable Xi 4; 
the distribution function of which is known to both players. Denote this 
distribution function by Fy 4 (x) = P[X 45 < x] » This leads us to the 


l. This restriction simplifies the notation without affecting the logic 
of the argument. 
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following Definition: F(x) = |/?14 G0) is & matrix of known distrib- 
ution functions. This matrix describes in probabilistic terms the pay- 
off to player I associated with each possible choice (i,j) of pure strat- 


egies by the players. 


Now assume that the players wish to employ mixed strategies. 

Q' = (q),42,-+-4m) i8 & mixed strategy of player I. 

P' = (pj,P2,->+P,) is a mixed strategy of player II. 

When the mixed strategies Q and P are used, we can determine the 
probability distribution function of the payoff to player I as follows: 

Define Y to denote the real commodity payoff to player I when the 
mixed strategies Q and P are used in conjunction with the matrix F(x). 


Then 


= 
G(y) = Pl¥=<y] = fy PIY = Xay] PLXiy=y], tee., 

a 
Gly) = Fy WPyFigO) - (eq. 1) 


This can be written in matrix notetion as: 


Gly) = Q'F(y)P . (eq. 2) 


Looking at equations 1 and 2, we can see that the distribution of 
the “real" payoff to player I involves two types of parameters: (1) those 
associated with the distributions Py g(x), which are assumed given and un- 
changeable, and (2) the elements q;, Pj of the matrices Q and P. These 
are under the control of the players. 

We can now formulate the objective of player I in choosing his mixed 
strategy Q. Player I wishes to choose Q s0 as to optimize, in some sense, 
the form of the distribution function G(y). This interpretation of the 


role of a mixed strategy is a central feature of this paper. 


3 





ac this point, the theory ef the 1.0. f emtere the pietur2. In the 
next section we see how the l.u.f. provides a criterion whereby each player 
way make a pregereatial ordering of his possible mixed strategies, and 


select an optimal one. 


om The theoretical and practical roles of the 1.u.f. 


Appendix A presents for quick reference the axiomatic development of 
the l.u.f., largely following the excellent treatment in Luce and Raiffa 
[2] . Appendix B gives a very slight extension of the idea, primarily so 
that we can talk about a continuous 1l.u.f£. u(x) defined over an “infinitely 
divisible" real commodity”. 

In this section, we do no more than write down some conclusions and 
interpretations concerning the l.u.f. Appendices A and B are, hopefully, 
arranged so that the reader who wishes to check the reasoning behind the 
statements made here may do so easily. 

The pertinent conclusions concerning the l.u.f. are the following: 

(1) The axiomatic design of a player's l.u.f. is such that it calls 
on information not available in the matrices F(x), Q, and P, in order to 
measure the true value that a real commodity has for the player, vis-a-vis 
the gambles involved in a risky sitwation (i.e., the game). 

(2) Suppose a player enters a gamble of the following form: he will 
win (or lose) a random amount X of some commodity, where X has the known 
distribution function H(x). This is called a “simple” gamble. If the 
player has a l1.u.f. u(x) defined over the commodity, he can compute the 


true value v which the simple gamble has for him by forming the 


2. The notation tends to be simpler when discussing this case. 


4 
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expectation 
v = Elu(X)] = (n(x) dH(x) ; 


(3) Suppose the player is involved in a “compound" gamble; that is, 
with probability a. he enters a simple gamble (x), k = 1,2,....n. 
The true value v to the player of this compound gamble can be computed in 


terms of his l.u.f. as?: 


bray 2 he 
v = E(u(x)] = (ucanaly a, m6) = hap ) 8 (x) die) . 

(4) Even though the operation involved in (2) and (3) above is that 
of taking an expected value of u(X), the interpretation of v as a "true 
value" does not depend upon there being many repetitions of the gamble. 
The l.u.f. is so designed that it measures the value of a one-time gamble. 

(5) Once a player has determined what his l.u.f. u(x) is for a cer- 
tain commodity x, he can use u(x) to compare any two gambles involving x. 
If Hj(x) and Ho(x) are the distribution functions associated with any two 


such gambles, simple or compound, the player computes 


00 
Vv, = j8Goait (=) and Vo * {ucayan (wo ' 
The gamble involving H, (x) is preferred to the gamble involving H, (x) if 
and only if V1>Vo- By using u(x) in this way, the player obtains a com- 
plete and transitive ordering of his preference for all possible gambles 
over the commodity x. 
These are the five conclusions which are of most interest to us. 

Next, we make some interpretations, based on these conclusions. 


1. This property of u(x) contributes the unfortunate word "linear" to 
the name of the l.u.f. 
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(1) When the players in a two-person matrix game make a choice of 
pure strategies, a "simple" gamble results. From player I's point of view, 
this gamble involves the random variable Xi 4 » with distribution function 
Fy 4 (x). 

(2) When the players employ mixed strategies Q and P, player I faces 
a “compound” gamble involving the random variable Y with distribution 
function G(y), as given by equation 1. 

(3) Recall our earlier observation that by his choice of a mixed 
strategy Q, player I exerts himself to determine a form of G(y) which is 
in some sense optimal for him. We now interpret the theoretical role of the 
l.u.f. in the logic of Game Theory to be: the l.u.f. is a criterion- 
providing device that enables a player to compute which form of G(y), and 
hence which mixed strategy, is best for hin. 

(4) Suppose that a matrix game involves a real commodity x for which 
the players know their l.u.f.'s to be u, (x) and u9(x) respectively. Suppose 
further that u, (x) = @u, (x). This is of course the so-called "zero-sum" 
assumption. If the players use mixed strategies Q and P, we write the 
distribution of the real payoff Y as G(y) = G(y;Q,P) to emphasize the depen- 
dence of this function upon the players’ choices. It now follows from the 
discussion under conclusion (5) above, that for extreme conservatism the 
optimal choices Q) = Ch o0eeae) and PS ~ (Pls e+e ePa) of mixed strategies 
are defined by the equation 

oo 


u(y) 46(¥3 QB.) = max min ) w(y)46(y5Q,P) 
“00 Q P 


“GO 


(eq. 3) 
= min max \ u(y)€6(3@P) ‘ 
Ps oO. 


Written in this form, the definition of optimal Q,? Po looks unfamiliar 
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and rather formidable. However, referring to equation 1, we can re-write 
equation 3 as: 
> gp (% : 
,j 1°3 \ ucy)aPy sy? = max min GP, \ucyar, 5) 
#00 Q r i,j #00 
(eq. 4) 


The "linear" property of u(x) discussed under conclusion (3) above makes 


it proper for us to define 


co 
ai = (u (y) dF; 5 (y), 


“00 
and to re-write equation 4 as 
re) 
ys qiP 584 5 = max min ml 44P 5945: (eq. 5) 
i,j Q P i,j 
By introducing the utility matrix A = | Cag 9) || » we can reduce equation 5 
to the familiar form: 
QiaP = max min Q'AP = min max Q’AP . (eq. 6) 
- Q P PQ 


Thus we have seen, step by step, how knowledge of a player's l.u.f. 
for a commodity x lies at the center of the famous minimax formulation of 
the zero-sum game solution, as given in equation 6. The essential obser- 
vation to make is: if the l.u.f.‘s are not known, the rationale leading 
to equation 6 as a definition of optimality breaks down, for there can be 
no guarantee that the players are maximizing (minimizing) the "true value" 
of the compound gamble represented by the game. In particular, it is not 
in general correct to replace a; j by the quantity m, } * ( AF, (x), and 
to maximize (minimize) the expected real value of the game. To do so 


ignores the entire scheme for evaluating gambles that is provided by the 








conetruction of the l.w.f. 

This completes our review of the role of the l.u.f. and its practical 
importance to the operations analyst who wishes to use the "customary" 
approach to the solution of a@ matrix game, as represented in the zero-sum 
case by equation 6. 

In the next section, we discuss procedures for handling a competitive 
situation when a knowledge of l.u.f.'s is not available, It seems approp- 
riate, however, to first answer the question, "Why not devote the effort 
instead to the problem of determining the l.u.f. experimentally, rather than 
adopting the pessimistic attitude that appropriate l.u.f.'sa are not likely 
to be known?" 

The answer, of course, is that a great deal of work has been done in 
an effort to weasure representative l.u.f.'s, and the results have been 
largely discouraging, not to say unbelievable, See, for instance, [3] 
and [4]. Since experiments performed under controlled and simplified 
laboratory conditions have failed to yield satisfactory results, it does 
not seem reasonable to assume that the military operations researcher, 
working with far more complex problems of value, will be able to isolate 
functions which truly describe how responsible military commanders evaluate 
difficult situations in terms of "linear utilities". As a practical matter, 
it seems much more sensible to look for procedures which make some sense 
in the absence of knowledge about linear utilities. 

One more point should be made before we go on. If we are going to 
abandon all knowledge of utility functions and attempt to “solve” compet- 
itive situations (games) using only the distribution function G(y) as a 
point of departure, we must clearly wnderstand what we can and cannot hope 


to accomplish. 
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Consider the situation in which a matrix game has been formulated, and 
a knowledge of appropriate l.u.f.'s assumed, In this case, if the payoff 
is zero-sum, the assertion of Game Theory is that there is a logical and 
highly plausible definition of "optimum solution" available, and a scheme 
for finding itt. In other words, we are guaranteed the optimum result. 

Contrast this with the situation that exists when no knowledge of 
the l.u.f. is assumed. This lack of knowledge means that we do not know how 
to value the commodity for which we are gambling, vis-a-vis the risks in- 
volved. It is clearly unrealistic to hope that, in this condition of ignor- 
ance, we shall be able to characterize any decision we make as the optimal 
one. We can hope to formulate criteria which may be "plausible" and 
"reasonable" when applied to specific situations, and we can hope that these 
criteria, through their mathematical formulation, will lead to decisions 
which are "approximately optimal", This is of course a very loose state- 
ment, since we do not have, and will not formwlate, a means of measuring 
"distances" between a theoretical optimum and any other point in the space 
of possible solutions. We are simply proceeding in accordance with the be- 
lief that it is better to have some well-understood criterion for the good- 
ness of a decision, and some systematic scheme for computing what that 
decision should be, keeping in mind ail the potential shortcomings, than 


to have none at all and simply proceed hit-or-miss. 


4. Criteria for the solution of matrix games in the absence of linear 
utility functions. 


Consider first the case of a game that is going to be played one time 
only. For this case, we shall formlate solutions that are approximately 
1. We have not discussed such schemes so far, but the “simplex method" 
may be regarded as a general algorithm for solving this class of zames. 
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optimal under the following criteria; 


Criterion A: That mixed strategy is best for player I which 
maximizes the probability that the real-comaodity payoff will 
lie in any one of a finite number of given, disjoint inter- 
vals, assuming that player II acts so as to minimize this 


probability. 


Criterion B: That mixed strategy is best for player I which 
i) maximizes the probability that the real-commodity 
payoff will exceed a given amount b, subject to the condition 
that 
ii) the probability that the payoff is less than a given 
amount c, shall be less than a given figure a, no matter what 
strategy player II uses. Assume that player II wishes to min- 


imize the probability that player I receives more than b. 


These are certainly not the only criteria one could consider. How- 
ever, they have features which are of practical value. Three obvious 
special cases of Criterion A call for 

(1) maximizing the probability that the real payoff exceeds a given 
value (perhaps some "critical" value), 

(2) minimizing the same probability, 

(3) maximizing the probability that the payoff lies in some specified 
interval. 

Certainly one can think of practical situations in which one of these 
might be a very reasonable course of action to follow. What is more impor-~- 
tant, it is not hard to imagine that a military commander might have just 
enough information to decide on one or two "critical" values, when a utility 
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measure over &11 pessible outcomes would be impossible to find. As an exam- 
ple, the commander who decides, "I mist at all costs destroy at least 80% 
of the enemy fleet in this engagement", has adopted special case (1) above 
as his criterion for an optimal course of action. A more homely example 
would be AR expert poker player who decides, "I want to win at least twenty 
dollars in this game, but I'd better not win more than fifty, or these 
people might not let me fleece them in the future." The poker expert is 
looking toward special case (3). 

Obviously, Criterion A is worth considering. The general formulation 
is no more difficult than the special cases, as will be seen’. 

It is a little harder to read sense into Criterion B at first glance, 
but an example will make it clear that it too has a plausible, common-sense 
basis. Suppose a businessman has the following thoughts: "I would like 
very much to make a profit of $100,000 or more this year. ($100,000 = 
given amount b). On the other hand, I'll have to take some risks to do that. 
I have some debts outstanding, my son enters Harvard this September, and I 
want to vacation in Hawaii this year. If I show a profit of less than 
$10,000, things are going to be tough. ($10,000 = given amount ¢). Weill, 
I'll try for the big profit, but only if I can work it so that my chances 
of making less than $10,000 are small; say about 10%."" (10% = given figure 
a). 

One can easily construct examples with a military flavor, involving 
the value of an objective, and the risks and costs associated with achieving 
it. As with criterion A, the point is this: It is far more reasonable to 
expect our businessman in the example to somehow get hold of the figures 
$100,000, $10,000, and 10Z, than it is to expect him to construct his 
1. It is also convenient to solve this problem for any finite number of 


intervals, since solutions for all the more practical special cases follow 
immediately. 
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complete l.u.f, for money. As for asking the Joint Chiefs of Staff what 
their combined 1].u.f. for casualties in Southeast Asia might be ---one 
shudders at the thought. 

In connection with both Criteria, we have made the assumption, essent- 
fally, that player II's aims will be directly opposed to those of player I. 

»% Obviously, in many real situations this will not be true. When we deal 
with this difficulty, we get into the whole problem of non-zcro-sum games, 
definitions of stable solutions, etc. 

Now, in the absence of l.u.f.°s, or any measure of "true value" assoc- 
iated with real outcomes of a game, we lack the means for deciding whether 
or not a game is zero-sum. Hence, we should really consider the difficult 
non-zero-sum case. To get around this situation, we adopt the following 
point of view, which should be regarded as part of Criteria A and B, and 
therefore as an “input" to any solution stewing from then: 

The most conservative policy that player I can follow is to assume 
that player II is an “implacable foe". In practical situations where this 
is largely true, all will be well, and incorporation of this assumption 
will be largely justified. When this assumption is not true, player I will 
get at least slightly better results than if player II were "“iwplacably 
opposed", but will not be able to take maximum advantage of possible 
“community of interest" arrangements, etc. In this latter case, the pro- 
aiticela) Gatit ch we develop here probably will not be very good, and player I 
should exercise his ingenuity to analyze the special features of the part- 
icular situation, in order to find a better solution. In other words, a 
practical application of these ideas requires a realistic look at the con- 
text in which they are to be applied, and no good solution can be expected 
from their use in inappropriate circumstances. 
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5. Mathematical formalation of the solution under criterion A. 





We are still dealing with the case where the real payoff is one- 
dimensional, that is, expressible in terms of a single real commodity x. 
Following the previous notation, let Q and P be mixed strategies of 
players I and II respectively, F(x) the matrix defined on page 3, and Y 
the real payoff to player I resulting from choices of Q and P. Denote 
the disjoint intervals of Criterion A by I, = [a,,b,! i la, sb, ] Oe 


I lab. | - Then 


P [ ver, or YéI, or... ver | =. i dG(y) 
ay, 


n by 
d 
pa Pay a by equation 1. 


b n 


k 
q,P, [F,,(b,) - F,,(a,)} . 
k=1 1,j%1P3 \ dF, 5 (y) = 2, i,j ivj Les k 1921 
a 


Using matrix notation (see equation 2), re-write this as 
¢ ee e S 4 - | 
Q [Pe ) F(a) P+ Q' [F(b,) F(a,)] D cee 0 [FD ) #(a,)] P 
= Q'|F(b)) +... + F(b ) - F(a ) -.-.°F(@,)| P . 


In the last expression, denote the matrix [F(b,) +.0° F(a,,)| by H. This 
is a matrix of real numbers, completely determined by F(x) and the intervals 
I> which are given quantities. Therefore, Q'HP is an ordinary bilinear 
form, and under our assumption about the behaviour of player II, conservative 
optimal mixed strategies % and Py are defined by 

Q°HP,, = max min Q'HP . (eq. 7) 

Q P 
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The nice feature of this result is of course that all the machinery 
for finding a and a that one uses in connection with zero-sum games, is 
appropriately used here. The difference lies in the interpretation of H 
as a matrix whose elements come from probability calculations, rather than 
a utility function. 

In the special case of maximizing P [Y>c] where c is some "critical" 
value, the result is 

,00 -00 
Piy><] = \ dct) = >) aFyy(y) = D2 aypy lh - Fayed] - 
c i,j “c i,j 
To put this in the more convenient matrix notation, define La to 


be a m-by-n matrix each element of which is unity, and write 
Piy>c] = Ql. - F(cP . 
Optimal strategies Q and Po are defined by 
Q’ nn> Fc] P, = =a x Q' [17 Fed} P (eq. 8) 


An interesting sub-special case develops when the probability dis- 


tributions Fy of the matrix F(x) are degenerate, that is: 


Fi) = Q for x < hij 


= 1 for x> hij ° 


This corresponds to the case where the real-commodity payoff associated 
with the choice (i,j) of pure strategies is a fixed amount hy 4, rather 

than random. In this case the matrix [lon = F(c)| of equation 8 is a 
matrix in which each element is either zero or one. Choice of a simplified 


criterion certainly leads in this case to a simplified problem to be solved! 
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6. Mathematical formulation of the solution under Criterion B. 


The procedure for finding an optimal strategy under Criterion B is 
somewhat more complicated than it is under Criterion A. We use a linear 
programming approach [5], [6]. The problem turns owt to be of the 
"maximization" variety, which entails 

(1) maximizing a linear form 2 wsty in non-negative variables ty, 
subject to 

(2) a set of linear constraints (inequalities) on the variables t;. 

Once such a problem has been correctly formulated, the set} of val- 
ues of the variables t; that maximizes the linear form may be found 
(provided a set satisfying the constraints exists) by the "Simplex 
Method" [5] , 16] , which may be regarded as a general algorithm for solv- 
ing linear programming problems. In any particular case, the amount of 
computation involved may be enormous, so that a high-speed computer may 
be needed to get actual numerical results. Conceptually however, the 
solution is always computable when it exists. 

In Appendix 5 of their book "Games and Decisions", Luce and Raiffa 
give an excellent formulation of the ordinary two-person zero-sum game 
solution as a linear progremming problem. The problem here turns out to 
be of a very similar form, so the reader might wish to refer to Luce and 
Raiffa before going on. 

Using the same notation as before, the solution is developed in 
nine steps as follows: 


(1) Let F(b) have dimensions m x n, and consider the matrix equation 


F(b) Q = B. 


1. Not necessarily unique. 
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fhe n x 1 matrix B is defined by this equation, and censists of elements 


y (j = 1,2,..,n) with 
m 
B, = = q,F, 0) , 


It is easy to see that B is the probability that Y < b, when player I 
uses mixed strategy Q and player II uses his ‘en pure strategy. Note that 
| 

| B,=0 for all j. 


(2) Define Bio = 2h B, ° 


Player I wishes to maximize P [Y>b] , and therefore to minimize P [y¢b] . 
Hence, under our assumption about the behavior of player II which was pre- 
viously stated, the objective of player I becomes: choose Q 80 as to min- 
imize B 0° 


j 


(3) Define the m x 1 matrix U = (1/B;,)Q , and define 1, to be an 


n x 1 matrix, each element of which is unity. Then we have 


(1/3 


a 
and consequently, 


F(b)'usZ 1. 
n 


(4) We can denote the matrix U by U' = (¥),t9,.-,U,), where 





oe (4, /B,,) , Sha 1l,2,.22,m . 


| Note that u,=0 for all i. The “, are the non-negative variables of this 


linear programming problem. 
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(5) Next, note thac 


Ja = ae = (q,/By = 1/B, | ; 


This can also be written as 


l‘u = 1/B - 
m Jo 


where Ls is an m x 1 matrix of one's. Since player I wishes to minimize 


By , he therefore will maximize 1/B . which is equivalent to maximizing 
oO 


j 
the linear form 1'U . 

(6) Thus, 1 is the linear form in non-negative variables to be max- 
imized, and from step (3) above, the matrix equation F(b)'U<1, represents 
a set of linear constraints. 

(7) There is, however, another set of linear constraints to be con- 
sidered. Recall that under Criterion B, we demand P [yecle a, against 
each pure strategy of player II. The matrix equation which expresses this 
requirement is 

F(c)'Q<a i. 


We need to include this equation emong the constraints. To do this, re- 


call that 
3 6 
DAs lo Le 
Therefore, 
¢ 8 
F(c)'QQ/B, ) <a Liu), 
or 


t Zz ’ 

F(c) veal. U ’ 
where 1 on is anm x n matrix, each element of which is unity. Now define 
O, to be an n x 1 watrix of zeros, and rewrite the last equation as 
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[ =yta , a 
iF(e)'- a | U =e é 


(5) Thus far, then, we have the problem 
maximize a ur 
subject to 


F(b)'UZ1 and ([F(c)' -al' |]uZo . 
n mn n 


The two equations expressing the constraints can be combined into one 


_matrix equation. Define two partitioned matrices A and C as follows; 








F(b)' | 1 
es = eS @peeeeaeeene dd © @ and C = eee @ os 
F(c)" =9a, 1 | | 0, | 


(9) The linear programming problem may now be written in final form 

as 
maximize 1) U , subject to A' UEC 

Of course, once U has been determined, player I's optimal strategy 
is found by computing 1/B,, = mm u; , 80 that optimal = Bio Us 

In conclusion, note that under Criterion B all the information neces- 
sary to construct the matrix A is assumed given, so that only the comput- 
ational problem remains. Furthermore, one can determine whether or not 
a solution exists by computing min max Q' F(c) P. If this quantity is 

Q P 


greater than a, it is obvious that there can be no solution; conversely, 


if it is smaller than a, there must be a solution. 


7. Extension to the case of a real payoff which is not one-dimensional. 


In the previous sections we considered games whose outcomes were measur- 
able in terms of a single real commodity, e.g., money. Let us now consider 
a case which, if not completely general, surely possesses enough general- 
ity to be of practical use. 
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Suppose that 4 géme can result in any one of © different outcomes, or 
"states" as we shall call them. Denote these states by S; nas eee oe 


S, must be distinguishable from 5, , for all if j. 


j 


Next, suppose that player I can make a complete and transitive order- 


ing of the states S. in terms of preference, where the word is used in its 


J 


everyday sense. By this we mean 


(1) For any two states S, and S, , i# j , player I either prefers 


Fy to ° » or vice versa. 
(2) If player I prefers S; to Sy and § 
to Sk : 


When player I is indifferent between S; and S, i.e,, has equal pre- 


; to Sy. » then he prefers Sy 


ference for them, we interpret this to mean that the relations “S; pre- 
ferred to S43" and “Ss, preferred to Ss," hold simultaneously. However, in 
order to simplify notation, we shall assume a strict ordering exists, with 
no cases of indifference, This assumption does not affect the validity of 
what follows; the case of indifference can be included at the expense of 
some extra symbols. 

Now, let the states be re-numbered in ascending order of prefercace, 
that is, S} is the least preferred state, Sy the next-to-least preferred, 
and so on until S, is the most preferred. During this process, we retaia 
the identity of each state, that is, we still know what real outcome each 


of the symbols S, represents. 


j 
Next we define S to be the ordered set S = {S; » So eS) o£ 


the re-numbered states. It is important to note thet the elements of § 
are disjoint. 

The purpose of all this is to lead to the following function. We 
define X as a function with domainS and range the set of integers 
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jel 2 een | , such that 
X 
SF ee for all S4E5. 

The function X is of course a random variable defined on the "sample 
space" S. Inasmuch as the probability distribution over S is Scemne for 
each choice (i,j) of pure strategies, the distribution of X is also known. 
Let us illustrate this fact. Suppose that corresponding to the choice 
(i,j) of pure strategies, the distribution of the outcome was 


eu [ outcome is s = Py for all S,.€5 


x O elsewhere. 
The corresponding distribution of X is given by 


Pay [x=k] = p, fork = 1,2,...,r 


= 0 elsewhere. 

Hence, given the matrix F(S,), k = 1,2,...,r, Of distribution functions 
(which corresponds to the matrix F(x) of previous sections), we can inmed- 
iately construct the matrix F*(k), k= 1,2,...,¥%, the elements of which 
are the distribution functions of X, corresponding to the pairs (i,j) of 
pure strategies. 

Since we ere dealing with a discrete random variable, we can also con- 
struct the matrix of probability mass functions which corresponds to F*(k). 
Denote this matrix of mass functions by J(k). 

Now we can repeat essentially all the computations of previous sections. 
For instance, if Q and P are the mixed strategies of the players, then (in 


terms of player I's preferences) 


l. By hypothesis; see p. 2. 
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P | outcome is preferred to S| = Pp | outcorme is Sia] or 
S449 O8...or s-J = PLx= g+i] + Pix = ga] +... + Pix eq] 
= Q' J(j+1) P+ Q' 5(j+2) P+ ...+ Q’ I(r) P 
= Qt | J(jtl) + S(j+2) +... + 5(r)] P. 

Another example: 


P | outcome is either s. or S, OF....0r SJ ae 


j 

Pix = i] + P[X = j] +....+ P[X =k] = 

Q’ [35(i) + JC§) + 2... T0kK)] P. 

We shall not pursue the details, but it should be clear, and the reader 
can easily verify, that everything goes through as before, including the 
solutions under Criteria A and B. 

Note that we have said nothing about the nature of the states S;: 
They may be defined in any way at all, quantitatively or qualitatively. 

We have required only that they be distinguishable, and that player I must 


have a transitive preference ordering over them. 


8. Some conments concerning the distribution function G(y). 


In section 2 we derived the distribution function 
— 
G(y) = /_ @,P,F,,(y) = Q'F(y)P . 
i ie 


It seems worthwhile to point out that G(y) represents a special case 
of an interesting class of distribution functions called "mixture distrib- 
utions", For an excellent discussion of this type of function, and some 
of its history, see Lz. Generally speaking, a distribution of this type 
arises when a sample is to be drawn from one of several populations, but 
only a probability statement can be made as to exactly which one. 


To illustrate one of the interesting properties of mixture distributions, 
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let us find the mean of G(y). 


oo oo 





E(Y) = 830) = a GPs i Se om Ty P 5M > 
where my * E(X; 4). If M = I ¢m, 5 | , then 
E(Y) = Q'MP, 
It is easy to show that, if M’ = |(ecx? ») | » then 


E(Y") = Q'M™’P 


Consequently, the variance of G(y) is given by 
Z 
Var(Y) = E(Y*) - E“(yY) = o'N-P -m . 


One may wish to ask, “If the distribution of Y can be written in terms 
of the distributions Pay as shown, what is the functional relationship be- 
tween Y and the Xq 5?" This can lead to some confusion unless it is real- 


ized that Y is defined by the equation G(y) = P Ly < y}] = Q'F(y)P . No 


other relation between Y and the Xs 5 is needed. 
9. Conclusion. 


There are two major conclusions which, it is hoped, the reader will 
have drawn from this paper. They are 

(1) A major practical stumbling block in the application of Game 
Theory to real problems is the unavailability of linear utility functions. 
We have seen how necessary a knowledge of these functions is, when it comes 
to making actual computations in order to solve a matrix game in the "usual" 
sense. 

(2) Despite this difficulty, there are reasonable procedures which can 
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be adopted, provided circumstances varrant, in order to vefine and compute 
“optimal'’ solutions. 

This paper is intended only to illustrate some ideas, and not to make 
a complete analysis of any particular class of solutions. We hope the 
reader will agree that the general scheme discussed here has possibilities 


for use in many situations other than the ones we have looked at. 
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APPENDIX A 


AXIOMATIC DEVELOPMENT OF THE LINEAR UTILITY FUNCTION 


Chapter 2 of "Games and Decisions" by Luce and Raiffa gives an ex- 
cellent presentation of this subject, and the reader is referred to 
their book for a detailed discussion of the axioms which follow. 

Definitions and notation. Assume all gambles or "lotteries" under 
discussion involve a finite set of prizes or alternatives, 

A= { Ay, Ayr seees A } . A simple lottery is defined to be a chance 


r 


mechanism which associates with each alternative A, a known probability 
P, = Loan» ; Cm) Ue SRE pee Pes 2 Py = 1. A simple lottery L is thus a 
probability distribution over A, and may also be regarded as an r-tuple, 
L bead {P)A), P2A,; ceoy pA} > 

A compound lottery is a two-stage chance mechanism, which, with 
probability q,;, yields a simple lottery p(t) as the result of its first 
(i) 


Stage. (€i@= 1,2,..38; te = 1). The simple lottery L then cone 
stitutes the second stage. We symbolize a compound lottery by 
fs mt, Si, asi ‘*)] », an s-tuple. Note that aC is again no more 
than a probability distribution over A, where P, (Ay) = = ‘ph 
The purpose is to derive the utility function of an individual. 
When we speak of preferences, they are the preferences of that individual. 
The symbol ? expresses a preference ordering. That is, A; * Ay 
means that the individual prefers A; to A, or is indifferent between 
them, and L > L' or L2f expresses the same relationship between two 
lotteries. If Ay Ay and Aye Ay hold simultaneously, we write Ay~ Ay. 
The symbol ~ denotes indifference between the two alternatives. 
Throughout what follows, it is assumed that the set A has been 


ordered so that A, is the most preferred alternative, and so on down, 
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until A, is the least preferred. This can be done with no loss of gener- 


ality. 


Axiom 1. The preference ordering £ » @ binary relation, yields a complete 


and transitive ordering of A. 


"Complete" means, for any A; and A, in A, either At As or a,+ Ay> 


Axiom 2. Let J be a compound lottery and L a simple lottery. If 


P, (4,) = Py (4,) for all A; in A, then {~ L. 


This says that it is only the character of a lottery as a probability 


distribution which interests the individual. 


Axiom 3. Each alternative A, is indifferent to some simple lottery 


which mixes only A_, the best alternative, and A.» the worst alternative. 


1 
That is, for every A; in A, there exists some probability u;, such that 


Ay~{ugay, (1 - uy)Ay] - 
Axiom 4. If A,~ (uj, (1 - u,; )A_| , then in any simple lottery 


L = (pyAj, «oe, PyAys coos PréAy), pyA, may be replaced by py LugAy, (1 - uy )Ar]. 


Axiom 5. The relations + and ~ are transitive among lotteries, just ase 


they are among the elements of A. 


This is obviously desirable, but it cannot be deduced from the pre- 


ceding axioms, and so must be included separately. 


Axiom 6. The relation IPA, (1 - p)A__| 2 [p'Ay> (1 - p')A,| holds if 


and only if pp’. 


These six axioms give us the following theorem, which is the point 
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of the entire development: 


Theorem. If the relation 2 satisfies axioms 1 through 6, then there 


exist numbers u, associated with the elements A, of A, such that for 


1 
any two lotteries L and L', the relation L > L’ holds if and only if 


Yr 


r 
D_ ¥yPy (Ay) D> uyP,"(A,) 
i=1 i=1 


The proof is obvious from the axioms. Any compound lottery can be 
reduced to a simple lottery involving Ajreees A,, which can in turn be 
converted to another compound lottery involving just Ay and A,, which can 
be reduced to a simple lottery involving A, and A,, which can by axiom 6 
be compared with any lottery so treated. The "experimental" part of the 
program lies in finding the numbers u; of Axiom 3. These first appear 
in the guise of probabilities, and later turn out to be the utilities 
which we sought. It is easy to show that any positive linear transfor- 
mation of these numbers again yields a utility function which reflects 
the individual's preferences. 

The reader who has followed through this derivation should easily 


be able to verify the assertions made in section 3 of this paper. 
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APPENDIX B 


A NON-RIGOROUS EXTENSION OF THE LINEAR UTILITY CONCEPT. 


In Appendix A, the assumption was made that the set A of alternatives 
associated with any lottery L was finite, and that the lottery comprised 
a probability mass function PD defined on A. Suppose now we wish to 
enter a gamble G involving some numerically measurable and infinitely 
divisible commodity x. Associated with G there is a probability density 
function f£,(x), defined over x. 

We want to make all the logic and conclusions of Appendix A valid 
for this case also, so we take the following position. Ali "continuous" 
gambles discussed in this paper are such that 

(1) If I is the interval over which x is defined, I may be parti- 
tioned into a finite number of disjoint subintervals dx;, whose union 
is I. If X is the random variable of the gamble, P(X é€ dx; | = fg (xy dx, ; 
where x,;€1x,. 


(2) By proper choice of dx, and x,;, we can approximate £q(x) as 


i 
closely as we like by a probability mass function f(x, dx; . 


Once this has been done, all the logic of Appendix A will go through, 





and the numbers u, can be found. We now assume that the set of numbers 

u; thus produced can be represented, as dx,;>O0 and the number of subinter- 
vals increases without limit, by a continuous curve u(x). when this is 
the case, it is intuitively clear that the continuous expected value 
calculation will have the same characteristics ag the discrete calculation 


in Appendix A. 
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